199 
202 
203 
204 
205 
207 
208 
209 
211 
212 
213 
214 
216 
217 
218 
219 
221 
222 
223 
224 
226 
227 
228 
229 
231 
232 
233 
234 
236 
237 
240 
241 
242 
243 
245 
246 
247 
249 
250 
251 
254 
255 
256 
257 
259 
260 
261 
263 
264 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/830 , 502A 



DATE: 05/22/2003 
TIME: 11:10:39 




Input Set : A:\C26151.app 

Output Set: N:\CRF4\05222003\l830502A.raw 

attgaggaga ggacgggcaa ggacccaagg gccctcacgg cctag 
<210> SEQ ID NO: 3 
LENGTH: 20 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 



2025 



<211> 
<212> 
<213> 
<220> 
<223> 



Description of Artificial Sequence: probe or 



OTHER INFORMATION 
primer 
<220> FEATURE: 
<221> NAME /KEY : tRNA 
<222> LOCATION: (4) 

<223> OTHER INFORMATION: w at position 4 can be T or A 
<220> FEATURE: 
<221> NAME/KEY: unsure 
<222> LOCATION: (5) 
<223> OTHER INFORMATION: 
<220> FEATURE: 
<221> NAME /KEY : unsure 
<222> LOCATION: (12) 
<223> OTHER INFORMATION: 
<220> FEATURE: 
<221> NAME /KEY : unsure 
<222> LOCATION: (15) 
<223> OTHER INFORMATION: 
<220> FEATURE: 
<221> NAME /KEY : unsure 
<222> LOCATION: (18) 
<223> OTHER INFORMATION: 
<400> SEQUENCE: 3 
atcwscgacg csgartayga 
<210> SEQ ID NO: 4 
<211> LENGTH : 7 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 

<??()> FRATTlBS -s , . 

<223>/OTHER INFORMATION: Description of Artificial Sequence 



s at position 5 can be C or G 



s at position 12 can be C or G 



r at position 15 can be G or A 



y at position 18 can be T or C 



20 



<400> SEQUENCE: 4 

lie Ser Asp Ala Glu Tyr Asp 

1 5 
<210> 
<211> 
<212> 
<213> 
<220> 
<223> 




<220> 
<221> 



SEQ ID NO: 5 
LENGTH: 20 
TYPE : DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence: 
primer 
FEATURE : 

NAME/KEY: unsure 



probe or 
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RAW SEQUENCE LISTING DATE : 05/22/2003 

PATENT APPLICATION: US/09/830 , 502A TIME: 11:10:39 

Input Set : A:\C26151.app 

Output Set: N:\CRF4\05222003\l830502A.raw 

2 65 <222> LOCATION: (3) 

266 <223> OTHER INFORMATION: s at position 3 can be C or G 

268 <220> FEATURE: 

269 <221> NAME /KEY : unsure 

270 <222> LOCATION: (6) 

271 <223> OTHER INFORMATION: s at position 6 can be C or G 

273 <220> FEATURE: 

274 <221> NAME /KEY : unsure 

275 <222> LOCATION: (8) 

276 <223> OTHER INFORMATION: k at position 8 can be G or T 

278 <220> FEATURE: 

279 <221> NAME/KEY: unsure 

280 <222> LOCATION: (9) 

281 <223> OTHER INFORMATION: s at position 9 can be G or C 

283 <220> FEATURE: 

284 <221> NAME/KEY: unsure 

285 <222> LOCATION: (12) 

286 <223> OTHER INFORMATION: s at position 12 can be G or C 

288 <220> FEATURE: 

289 <221> NAME /KEY : unsure 

290 <222> LOCATION: (15) 

291 <223> OTHER INFORMATION: y at position 15 can be C or T 

293 <220> FEATURE: 

294 <221> NAME/KEY: unsure 

295 <222> LOCATION: (18) 

296 <223> OTHER INFORMATION: r at position 18 can be A or G 

298 <400> SEQUENCE : 5 

299 ccsgtscksc csacytgraa 20 

302 <210> SEQ ID NO: 6 

303 <211> LENGTH: 20 

304 <212> TYPE: DNA 

305 <213> ORGANISM: Artificial Sequence 

307 <220> FEATURE: 

308 <223> OTHER INFORMATION: Description of Artificial Sequence: probe or 

309 primer 

311 <220> FEATURE: 

312 <221> NAME/KEY: unsure 

313 <222> LOCATION: (9) 

314 <223> OTHER INFORMATION: v at position 9 can be C, G, or A 

316 <220> FEATURE: 

317 <221> NAME /KEY : unsure 

318 <222> LOCATION: (11) 

319 <223> OTHER INFORMATION: r at position 11 can be A or G 

321 .<220> FEATURE: 

322 <221> NAME/KEY: unsure 

323 <222> LOCATION: (12) 

324 <223> OTHER INFORMATION: y at position 12 can be T or C 

326 <220> FEATURE: 

327 <221> NAME/KEY: unsure 
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RAW SEQUENCE LISTING ERROR SUMMARY 

PATENT APPLICATION: US/09/830 , 502A 



DATE: 05/22/2003 
TIME: 11:10:40 



Input Set : A:\C26151.app 

Output Set: N:\CRF4\05222003\l830502A.raw 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

22,23,24, 25, 26,27,28,29,30,31,32, 33,34,35,36 
41, 42, 43, 44, 45, 46, 47, 48, 4 9, 50, 51, 52, 53, 54, 55 
60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 7 1 , 72 , 73, 74 
79, 80, 81, 82, 83, 84, 85, 8 6, 87, 88, 89, 90, 91, 92, 93 
98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109 
113, 114, 115, 116, 117, 118, 119, 120, 126, 127, 128 
132, 133, 134, 135, 136, 137, 138, 139, 14 0, 141, 142 
14 6, 147, 14 8, 14 9, 150, 151, 152, 153, 154, 155, 156 
160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170 

22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36 
41, 42, 43, 4 4, 45, 4 6, 47, 4 8, 4 9, 50, 51, 52, 53, 54, 55 
60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74 
79, 80, 81, 82, 83, 84, 85,8 6, 87, 88, 89, 90, 91, 92, 93 
98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109 
113, 114, 115, 116, 117, 118, 119, 120, 129, 130, 131 
135, 136, 137, 138, 139, 14 0, 141, 142, 143, 14 4, 145 
14 9, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159 
163, 164, 165, 166, 167, 168 , 169, 170, 171, 172, 173 

22,23,24,25,26,27,28,2 9,30,31,32,33,34,35,36 
41, 42, 4 3, 4 4, 4 5, 4 6, 47, 48, 4 9, 50, 51,52, 53,54, 55 
60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71,72,73, 74 
7 9,80, 81, 82, 83, 84, 85, 8 6, 87, 88, 89, 90, 91, 92, 93 
98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109 
113, 114, 115, 116, 117, 118, 119, 120, 126, 127, 128 
132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142 
14 6, 147, 14 8, 14 9, 150, 151, 152, 153, 154, 155, 156 
160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170 

22, 23, 24, 25,26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36 
41, 42, 43, 44, 45, 4 6, 47, 48, 4 9, 50, 51, 52, 53, 54, 55 
60, 61, 62, 63, 64, 65, 66, 67, 68, 69,70,71, 72, 73,74 
7 9, 80, 81, 82, 83, 84, 85, 8 6, 87, 88, 8 9, 90, 91, 92, 93 
98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109 
113, 114, 115, 116, 117, 118, 119, 120, 126, 127, 128 
132, 133, 134, 135, 136, 137, 138, 139, 14 0, 141, 142 
14 6, 14 7, 14 8, 14 9, 150, 151, 152, 153, 154, 155, 156 
160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170 

22,23,24, 25,26, 27, 28, 29, 30, 31, 32,33,34, 35, 36 
41, 42,43, 44, 45, 4 6, 47, 48, 49, 50, 51, 52, 53, 54, 55 
60, 61, 62, 63, 64, 65, 66, 67, 68, 69,70,71, 72, 73, 74 
7 9, 80,81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93 



Seq#: 


15 


; Xaa 


Pos . 


18, 19, 20, 21 


Seq#: 


15 


; Xaa 


Pos . 


37, 38, 39, 40 


Seq#: 


15 


; Xaa 


Pos . 


56, 57, 58, 59 


Seq#: 


15 


; Xaa 


Pos. 


75,76,77,78 


Seq#: 


15 


; Xaa 


Pos . 


94, 95, 96, 97 


Seq#: 


15 


; Xaa 


Pos . 


110, 111, 112 


Seq#: 


15 


; Xaa 


Pos . 


129, 130, 131 


Seq#: 


15 


; Xaa 


Pos . 


143, 144, 145 


Seq#: 


15 


; Xaa 


Pos . 


157, 158, 159 


Seq#: 


15 


; Xaa 


Pos . 


171, 172 


Seq#: 


16 


; Xaa 


Pos . 


18, 19,20,21 


Seq#: 


16 


; Xaa 


Pos . 


37, 38, 39,40 


Seq#: 


16 


? Xaa 


Pos. 


56,57,58,59 


Seq#: 


16 


; Xaa 


Pos . 


75,76,77,78 


Seq#: 


16 


; Xaa 


Pos. 


94, 95, 96,97 


Seq#: 


16 


; Xaa 


Pos. 


110,111, 112 


Seq#: 


16 


; Xaa 


Pos. 


132,133, 134 


Seq#: 


16 


; Xaa 


Pos. 


146, 147, 148 


Seq#: 


16 


; Xaa 


Pos. 


160,161, 162 


Seq#: 


16- 


; Xaa 


Pos. 


174,175 


Seq#: 


17, 


; Xaa 


Pos. 


18, 19,20,21 


Seq#: 


17 


; Xaa 


Pos. 


37, 38,39, 40 


Seq#: 


17, 


? Xaa 


Pos. 


56,57,58,59 


Seq#: 


17, 


? Xaa 


Pos. 


75, 76, 77,78 


Seq#: 


17, 


• Xaa 


Pos. 


94, 95, 96, 97 


Seq#: 


17, 


■ Xaa 


Pos. 


110, 111, 112 


Seq#: 


17, 


• Xaa 


Pos. 


129, 130, 131 


Seq#: 


17, 


• Xaa 


Pos . 


143, 144, 145 


Seq#: 


17, 


• Xaa 


Pos. 


157, 158, 159 


Seq#: 


17, 


• Xaa 


Pos . 


171, 172 


Seq#: 


18, 


• Xaa 


Pos . 


18,19,20,21 


Seq#: 


18, 


• Xaa 


Pos. 


■ 37,38,39,40 


Seq#: 


18, 


• Xaa 


Pos. 


56, 57, 58,59 


Seq#: 


18, 


■ Xaa 


Pos. 


75,76,77,78 


Seq#: 


18, 


■ Xaa 


Pos. 


94, 95, 96,97 


Seq#: 


18, 


■ Xaa 


Pos. 


110, 111, 112 


Seq#: 


18, 


• Xaa 


Pos. 


129, 130, 131 


Seq#: 


18, 


- Xaa 


Pos. 


143, 144, 145 


Seq#: 


18, 


■ Xaa 


Pos. 


157, 158, 159 


Seq#: 


18, 


• Xaa 


Pos. 


171, 172 


Seq#: 


19, 


• Xaa 


Pos. 


18, 19,20,21 


Seq#: 


19, 


• Xaa 


Pos . 


37,38,39,40 


Seq#: 


19, 


Xaa 


Pos . 


56, 57,58, 59 


Seq#: 


19, 


Xaa 


Pos . 


75,76,77,78 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 05/22/2003 

PATENT APPLICATION: US/09/830 , 502A TIME: 11:10:40 



Input Set : A:\C26151.app 

Output Set: N:\CRF4\05222003\l830502A.raw 

Seq#:19; Xaa Pos . 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109 

Seq#:19; Xaa Pos. 110,111,112,113,114,115,116,117,118,119,120,12 6,127,128 

Seq#:19; Xaa Pos. 129,130,131,132,133,134,135,136,137,138,139,140,141,142 

Seq#:19; Xaa Pos. 14 3,14 4,145,14 6,14 7,14 8,14 9,150,151,152,153,154,155,156 

Seq#:19; Xaa Pos. 157,158,159,160,161,162,163,164,165,166,167,168,169,170 

Seq#:19; Xaa Pos. 171,172 

Seq#:20; Xaa Pos. 18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36 
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VERIFICATION SUMMARY DATE: 05/22/2003 

PATENT APPLICATION: US/09/830 , 502A TIME: 11:10:40 



Input Set : A:\C26151.app 

Output Set: N:\CRF4\05222003\l830502A.raw 



L : 


A *7 A 

4 /U 


M : x w : 


{ A C\ H — It 


or 


Aaa 


used, 


ior 




lUff 


1 0 


after 


pos . 


. X o 


M: 


341 


Repeated 


in SeqNo= 


15 




















L: 


523 


M:341 W: 


(46) "n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


16 


after 


pos . 


:16 


M: 


341 


Repeated 


in SeqNo^ 


16 




















L : 


576 


M:341 W: 


(4 6) "n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


17 


after 


pos . 


:16 


M: 


341 


Repeated 


in SeqNo= 


17 




















L: 


629 


M:341 W: 


(4 6) "n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


18 


after 


pos . 


:16 


M: 


341 


Repeated 


in SeqNo= 


18 




















L: 


682 


M:341 W: 


(4 6) "n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


19 


after 


pos . 


:16 


M: 


341 


Repeated 


in SeqNo= 


19 




















L: 


735 


M:341 W: 


(46) "n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


20 


after 


pos . 


: 16 


M: 


341 


Repeated 


in SeqNo= 


20 
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STATISTICS SUMMARY DATE: 05/22/2003 

PATENT APPLICATION: US/10/085 , 418E TIME: 14:24:44 

Input Set : N:\jumbos\10085418\PTOMS.txt 
Output Set: N:\CRF4\05222003\J085418E.raw 

Application Serial Number: US/10/085, 418E 

Alpha or Numeric or Xml : Alpha 

Application Class: 

Application File Date: 02-28-2002 

Art Unit: OIPE 

Software Application: PatentINl.0 

Total Number of Sequences: 3 

Total Nucleotides: 3729 

Total Amino Acids: 0 

Number of Errors : 0 

Number of Warnings: 0 

Number of Corrections: 2 

MESSAGE SUMMARY 

220 C: 2 (Keyword misspelled or invalid format) 
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1600 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/830 , 502A 



DATE: 05/22/2003 
TIME: 11:10:39 



Input Set : A:\C26151.app 

Output Set: N:\CRF4\05222003\l830502A.raw 



3 


<110> 


APPLICANT 


: Barany, 


Francis 


















4 






Cao, Weiguo 
























5 






Tong, 


Jie 


























7 


<120> 


TITLE 


OF INVENTION: HIGH FIDELITY 


THERMOSTABLE LIGASE 


AND 


9 


<130> 


FILE REFERENCE: 19603/2615 


















11 


<140> 


CURRENT APPLICATION NUMBER 


: 09/830, 502A 










12 


<141> 


CURRENT FILING DATE: 1999- 


10-29 














14 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


60/106, 461 












15 


<151> 


PRIOR 


FILING 


DATE: 


1998-10 


-30 
















17 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


PCT/US99/25437 










18 


<151> 


PRIOR 


FILING 


DATE: 


1999-10 


-29 
















20 


<160> 


NUMBER OF 


SEQ 


ID NOS: 


20 


















22 


<170> 


SOFTWARE : 


Patent In Ver. 2. 


1 
















24 


<210> 


SEQ ID NO 


: 1 
























25 


<211> 


LENGTH: 67 4 
























26 


<212> 


TYPE: 


PRT 


























27 


<213> 


ORGANISM: 


Thermus 


sp. 




















29 


<400> 


SEQUENCE: 


1 
























30 


Met 


Thr 


Leu 


Glu 


Glu 


Ala 


Arg 


Arg 


Arg 


Val 


Asn 


Glu 


Leu 


Arg 


Asp 


Leu 


31 


1 










5 










10 










15 




33 


He 


Arg 


Tyr 


His 


Asn 


Tyr 


Leu 


Tyr 


Tyr 


Val 


Leu 


Asp 


Ala 


Pro 


Glu 


He 


34 










20 










25 










30 






36 


Ser 


Asp 


Ala 


Glu 


Tyr 


Asp 


Arg 


Leu 


Leu 


Arg 


Glu 


Leu 


Lys 


Glu 


Leu 


Glu 


37 








35 










40 










45 








39 


Glu 


Arg 


Phe 


Pro 


Glu 


Leu 


Lys 


Ser 


Pro 


Asp 


Ser 


Pro 


Thr 


Glu 


Gin 


Val 


40 




50 










55 










60 










42 


Gly 


Ala 


Arg 


Pro 


Leu 


Glu 


Ala 


Thr 


Phe 


Arg 


Pro 


Val 


Arg 


His 


Pro 


Thr 


43 


65 












70 










75 










80 


45 


Arg 


Met 


Tyr 


Ser 


Leu 


Asp 


Asn 


Ala 


Phe 


Ser 


Leu 


Asp 


Glu 


Val 


Arg 


Ala 


46 












85 










90 










95 




48 


Phe 


Glu 


Glu 


Arg 


He 


Glu 


Arg 


Ala 


Leu 


Gly 


Arg 


Lys 


Gly 


Pro 


Phe 


Leu 


49 










100 










105 










110 






51 


Tyr 


Thr 


Val 


Glu 


Arg 


Lys 


Val 


Asp 


Gly 


Leu 


Ser 


Val 


Asn 


Leu 


Tyr 


Tyr 


52 








115 










120 










125 








54 


Glu 


Glu 


Gly 


He 


Leu 


Val 


Phe 


Gly Ala 


Thr 


Arg 


Gly 


Asp 


Gly 


Glu 


Thr 


55 




130 










135 










140 










57 


Gly 


Glu 


Glu 


Val 


Thr 


Gin 


Asn 


Leu 


Leu 


Thr 


He 


Pro 


Thr 


He 


Pro 


Arg 


58 


145 












150 










155 










160 


60 


Arg 


Leu 


Thr 


Gly 


Val 


Pro 


Asp 


Arg 


Leu 


Glu 


Val 


Arg 


Gly 


Glu 


Val 


Tyr 


61 












165 










170 










175 




63 


Met 


Pro 


He 


Glu 


Ala 


Phe 


Leu 


Arg 


Leu 


Asn 


Gin 


Glu 


Leu 


Glu 


Glu 


Ala 


64 










180 










185 










190 






66 


Gly 


Glu 


Arg 


He 


Phe 


Lys 


Asn 


Pro 


Arg 


Asn 


Ala 


Ala 


Ala 


Gly 


Ser 


Leu 
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RAW SEQUENCE LISTING DATE: 05/22/2003 

PATENT APPLICATION: US/09/830 , 502A TIME: 11:10:39 



Input Set : A:\C26151.app 

Output Set: N:\CRF4\05222003\l830502A.raw 



67 




IOC 

195 










Z\)U 










9 n r 








69 Arg 


Gin 


Lys 


Asp 


Pro 


Arg 


Val 


Thr . 


Ala 


Arg , 


Arg 


Gly 


Leu 


Arg 


a ia 


i nr 


70 


210 










215 










o o n 










72 Phe 


Tyr . 


Ala 


Leu 


Gly 


Leu 


Gly 


Leu 


Glu 


Glu 


Thr 


Gly 


Leu 


Lys 


Ser 


Gin 


73 225 










230 










235 










O A C\ 

Z 4 U 


75 His 


Asp 


Leu 


Leu 


Leu 


Trp 


Leu 


Arg 


Glu 


Arg 


Gly 


Phe 


Pro 


Val 


GlU 


his 


76 








245 










250 










Zdd 




78 Gly 


Phe 


Thr 


Arg 


Ala 


Leu 


Gly . 


Ala 


Glu 


Gly " 


Val 


Glu 


Glu 


Val 


Tyr 


Gin 


79 






260 










265 










210 






81 Ala 


Trp 


Leu 


Lys 


Glu 


Arg 


Arg 


Lys 


Leu 


Pro 


Phe 


Glu . 


Ala 


Asp 


Gly 


Val 


82 




275 










280 










o o c 
ZOO 








84 Val 


Val 


Lys 


Leu 


Asp 


Asp 


Leu 


Ala 


Leu 


Trp 


Arg 


Glu 


Leu 


Gly 


Tyr 


Thr 


85 


290 










295 










300 










87 Ala 


Arg 


Thr 


Pro 


Arg 


Phe 


Ala 


Leu 


Ala 


Tyr 


Lys 


Phe 


Pro 


Ala 


Glu 


Glu 


88 305 










310 










315 










o o r\ 

320 


90 Lys 


Glu 


Thr 


Arg 


Leu 


Leu 


Ser 


Val 


Ala 


Phe 


Gin 


Val 


Gly 


Arg 


Thr 


Gly 


91 








325 










330 










335 




93 Arg 


He 


Thr 


Pro 


Val 


Gly 


Val 


Leu 


Glu 


Pro 


Val 


Phe 


He 


Glu 


Gly 


Ser 


94 






340 










345 










350 






96 Glu 


Val 


Ser 


Arg 


Val 


Thr 


Leu 


His 


Asn 


Glu 


Ser 


Phe 


He 


Glu 


Glu 


Leu 


97 




355 










360 










365 








99 Asp 


Val 


Arg 


He 


Gly 


Asp 


Trp 


Val 


Leu 


Val 


His 


Lys . 


Ala 


Gly 


Gly 


Val 


100 


370 










375 










380 










102 He 


Pro 


Glu 


Val 


Leu 


Arg 


Val 


Leu 


Lys 


Glu 


Arg 


Arg 


Thr 


Gly 


Glu 


Glu 


103 385 










390 










395 










4 00 


105 Lys 


Pro 


He 


He 


Trp 


Pro 


Glu 


Asn 


Cys 


Pro 


Glu 


Cys 


Gly 


His 


Ala 


Leu 


106 








405 










410 










415 




108 He 


Lys 


Glu 


Gly 


Lys 


Val 


His 


Arg 


Cys 


Pro 


Asn 


Pro 


Leu 


Cys 


Pro 


Ala 


109 






420 










425 










430 






111 Lys 


Arg 


Phe 


Glu 


Ala 


He 


Arg 


His 


Tyr 


Ala 


Ser 


Arg 


Lys 


Ala 


Met 


Asp 


112 




435 










440 










4 4 5 








114 He 


Gin 


Gly 


Leu 


Gly 


Glu 


Lys 


Leu 


He 


Glu 


Lys 


Leu 


Leu 


Glu 


Lys 


Gly 


115 


450 










455 










4 60 










117 Leu 


Val 


Arg 


Asp 


Val 


Ala 


Asp 


Leu 


Tyr 


Arg 


Leu 


Lys 


Lys 


Glu 


Asp 


Leu 


118 465 










470 










475 










4 o 0 


120 Val 


Asn 


Leu 


Glu 


Arg 


Met 


Gly 


Glu 


Lys 


Ser 


Ala 


Glu 


Asn 


Leu 


Leu 


Arg 


121 








485 










4 90 










4 y b 




123 Gin 


He 


Glu 


Glu 


Ser 


Lys 


Gly 


Arg 


Gly 


Leu 


Glu 


Arg 


Leu 


Leu 


Tyr 


Ala 


124 






500 










505 










510 






126 Leu 


Gly 


Leu 


Pro 


Gly 


Val 


Gly 


Glu 


Val 


Leu 


Ala 


Arg 


Asn 


Leu 


Ala 


Leu 


127 




515 










520 










525 








129 Arg 


Phe 


Gly 


His 


Met 


Asp 


Arg 


Leu 


Leu 


Glu 


Ala 


Gly 


Leu 


Glu 


Asp 


Leu 


130 


530 










535 










540 










132 Leu 


Glu 


Val 


Glu 


Gly 


Val 


Gly 


Glu 


Leu 


Thr 


Ala 


Arg 


Ala 


He 


Leu 


Asn 


133 545 










550 










555 










560 


135 Thr 


Leu 


Lys 


Asp 


Pro 


Glu 


Phe 


Arg 


Asp 


Leu 


Val 


Arg 


Arg 


Leu 


Lys 


Glu 


136 








565 










570 










575 




138 Ala 


Gly 


Val 


Glu 


Met 


Glu 


Ala 


Lys 


Glu 


Arg 


Glu 


Gly 


Glu 


Ala 


Leu 


Lys 


139 






580 










585 










590 
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141 Gly Leu Thr Phe Val He Thr Gly Glu Leu Ser 

Gly Ala Lys 



142 



595 600 
14 4 Val Lys Ala Leu Leu Arg Arg Leu 

145 610 615 

147 Ser Arg Lys Thr Ser Phe Leu Val Val Gly Glu 

148 625 630 635 

150 Leu Glu Lys Ala Arg Ala Leu Gly Val Pro Thr 

151 645 650 

153 Leu Tyr Arg Leu He Glu Glu Arg Thr Gly Lys 

154 660 665 
156 Thr Ala 

160 <210> SEQ ID NO: 2 

161 <211> LENGTH: 2025 

162 <212> TYPE: DNA 

163 <213> ORGANISM: Thermus sp. 

165 <400> SEQUENCE: 2 

166 atgaccctag aggaggcccg caggcgcgtc aacgaactca 

167 aactacctct attacgtctt ggacgccccc gagatctccg 

168 cttagggagc ttaaggagct ggaggagcgc tttcccgagc 

169 acggaacagg tgggggcgag gcctctggag gccaccttcc 

170 cgcatgtact ccctggacaa cgc.cttttcc ttggacgagg 

171 atagagcggg ccctggggcg gaaggggccc ttcctctaca 

172 ggtctttccg tgaacctcta ctacgaggag ggcatcctcg 

173 gacggggaga ccggggagga ggtgacccag aacctcctca 

174 cgcctcacgg gcgttccgga ccgcctcgag gtccggggcg 

175 gccttcctca ggctcaacca ggagctggag gaggcggggg 

176 aggaacgccg ccgccgggtc cttgcggcag aaagacccca 

177 ctgagggcca ccttttacgc cctggggctg ggcctggagg, 

178 cacgaccttc tcctatggct aagagagcgg ggctttcccg 
17 9 gccctggggg cggagggggt ggaggaggtc taccaggcct 

180 cttccctttg aggccgacgg ggtggtggtc aagctggacg 

181 ctggggtaca ccgcccgcac cccccgcttc gccctcgcct 

182 aaggagaccc gcctcctctc cgtggccttc caggtggggc 

183 gtgggcgttc tggagcccgt cttcatagag ggcagcgagg 

184 aacgagagct tcattgagga gctggacgtg cgcatcggcg 

185 gcgggcgggg tgattcccga ggtgctgagg gtcctgaaag 

186 aagcccatca tctggcccga gaactgcccc gagtgcggcc 

187 aaggtccacc gctgccccaa ccccttgtgc cccgccaagc 

188 tacgcctccc gcaaggccat ggacatccag ggcctggggg 

189 ctggaaaagg gcctggtccg ggacgtggcc gacctctacc 

190 gtgaacctgg agcgcatggg ggagaagagc gcagagaacc 

191 agcaagggcc gcggcctgga gcgcctcctt tacgccctgg 

192 gtgctggccc ggaacctggc cctccgcttc ggccacatgg 

193 ctcgaggacc tcctggaggt ggagggggtg ggcgagctca 

194 accctaaagg acccggagtt ccgggacctg gtgcgccgcc 

195 atggaggcca aagagcggga gggcgaggcc ttgaaggggc 

196 gagctttccc ggccccggga ggaggtgaag gccctcctta 

197 acggactcgg tgagccgcaa gacgagcttc ctggtggtgg 

198 ctggaaaagg cccgcgcctt gggggtcccc accctgagcg 



Arg Pro Arg Glu Glu 
605 

Val Thr Asp Ser Val 
620 

Asn Pro Gly Ser Lys 
640 

Leu Ser Glu Glu Glu 
655 

Asp Pro Arg Ala Leu 
670 



gggacctgat 
acgccgagta 
tcaaaagccc 
gcccggtgcg 
tgagggcctt 
ccgtggagcg 
tctttggggc 
ccatccccac 
aggtctacat 
agcgcatctt 
gggtcacggc 
aaaccgggtt 
tggagcacgg 
ggctcaagga 
acctcgccct 
acaagttccc 
ggaccgggcg 
tgagccgggt 
actgggtgct 
agcgccgcac 
acgccctcat 
gctttgaggc 
agaagctcat 
gcctgaagaa 
tcctccgcca 
gccttcccgg 
accgccttct 
ccgcccgggc 
tgaaggaggc 
tcaccttcgt 
ggcggcttgg 
gggagaaccc 
aggaggagct 



ccgttaccac 
cgaccggctc 
cgactccccc 
ccaccccacc 
tgaggagcgc 
caaggtggac 
cacccggggc 
cattccccgc 
gcccatagag 
caaaaacccc 
caggcggggc 
aaaaagccag 
ctttacccgg 
gaggcggaag 
ctggcgggag 
ggccgaggag 
catcaccccc 
caccctccac 
ggtccacaag 

caaggagggg 
catccgccac 
agaaaagctt 
ggaggacctg 
gatagaggag 
ggtgggggag 
ggaggcgggc 
catcctgaat 
cggggtggag 
catcaccggg 
ggccaaggtg 
ggggagcaag 
ctaccgcotc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 
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